Search for: All records

Creators/Authors contains: "Williamson, Donald S."

« Prev Next »

Total Resources

8

Resource Type
Conference Paper

4

Conference Proceeding

0

Dataset

0

Journal Article

4

Workshop Report

0

Availability
Full Text / Resource Available

6

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

From the perspective of perceptual speech quality: The robustness of frequency bands to noise

https://doi.org/10.1121/10.0025272

Fan, Junyi ; Williamson, Donald S. ( March 2024 , The Journal of the Acoustical Society of America)

Speech quality is one of the main foci of speech-related research, where it is frequently studied with speech intelligibility, another essential measurement. Band-level perceptual speech intelligibility, however, has been studied frequently, whereas speech quality has not been thoroughly analyzed. In this paper, a Multiple Stimuli With Hidden Reference and Anchor (MUSHRA) inspired approach was proposed to study the individual robustness of frequency bands to noise with perceptual speech quality as the measure. Speech signals were filtered into thirty-two frequency bands with compromising real-world noise employed at different signal-to-noise ratios. Robustness to noise indices of individual frequency bands was calculated based on the human-rated perceptual quality scores assigned to the reconstructed noisy speech signals. Trends in the results suggest the mid-frequency region appeared less robust to noise in terms of perceptual speech quality. These findings suggest future research aiming at improving speech quality should pay more attention to the mid-frequency region of the speech signals accordingly.

more » « less
Free, publicly-accessible full text available March 1, 2025
Attention-Based Speech Enhancement Using Human Quality Perception Modeling

https://doi.org/10.1109/TASLP.2023.3328282

Nayem, Khandokar Md. ; Williamson, Donald S. ( January 2024 , IEEE/ACM Transactions on Audio, Speech, and Language Processing)

Free, publicly-accessible full text available January 1, 2025
An End-To-End Non-Intrusive Model for Subjective and Objective Real-World Speech Assessment Using a Multi-Task Framework

https://doi.org/10.1109/ICASSP39728.2021.9414182

Zhang, Zhuohuang ; Vyas, Piyush ; Dong, Xuan ; Williamson, Donald S. ( June 2021 , IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
null (Ed.)
Full Text Available
Towards real-world objective speech quality and intelligibility assessment using speech-enhancement residuals and convolutional long short-term memory networks

https://doi.org/10.1121/10.0002702

Dong, Xuan ; Williamson, Donald S. ( November 2020 , The Journal of the Acoustical Society of America)
null (Ed.)
Full Text Available
A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals

https://doi.org/10.21437/Interspeech.2020-2809

Dong, Xuan ; Williamson, Donald S. ( October 2020 , Proc. Interspeech 2020)
null (Ed.)
Full Text Available
Defending Against Microphone-Based Attacks with Personalized Noise

https://doi.org/10.2478/popets-2021-0021

Liu, Yuchen ; Xiang, Ziyu ; Seong, Eun Ji ; Kapadia, Apu ; Williamson, Donald S. ( January 2021 , Proceedings on Privacy Enhancing Technologies)
null (Ed.)
Abstract Voice-activated commands have become a key feature of popular devices such as smartphones, home assistants, and wearables. For convenience, many people configure their devices to be ‘always on’ and listening for voice commands from the user using a trigger phrase such as “Hey Siri,” “Okay Google,” or “Alexa.” However, false positives for these triggers often result in privacy violations with conversations being inadvertently uploaded to the cloud. In addition, malware that can record one’s conversations remains a signifi-cant threat to privacy. Unlike with cameras, which people can physically obscure and be assured of their privacy, people do not have a way of knowing whether their microphone is indeed off and are left with no tangible defenses against voice based attacks. We envision a general-purpose physical defense that uses a speaker to inject specialized obfuscating ‘babble noise’ into the microphones of devices to protect against automated and human based attacks. We present a comprehensive study of how specially crafted, personalized ‘babble’ noise (‘MyBabble’) can be effective at moderate signal-to-noise ratios and can provide a viable defense against microphone based eavesdropping attacks.
more » « less
Full Text Available
An Attention Enhanced Multi-Task Model for Objective Speech Assessment in Real-World Environments

https://doi.org/10.1109/ICASSP40776.2020.9053366

Dong, Xuan ; Williamson, Donald S. ( May 2020 , ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing)

Full Text Available
A Classification-Aided Framework for Non-Intrusive Speech Quality Assessment

https://doi.org/10.1109/WASPAA.2019.8937192

Dong, Xuan ; Williamson, Donald S. ( October 2019 , 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA))

Full Text Available